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Abstract. We systematically investigate the problem of representing 
Markov chains by families of random maps, and what regularity of these 
maps can be achieved depending on the properties of the probability 
measures. Our key idea is to use techniques from optimal transport to 
select optimal such maps. Optimal transport theory also tells us how 
convexity properties of the supports of the measures translate into reg- 
ularity properties of the maps via Legendre transforms. Thus, from this 
scheme, we cannot not only deduce the representation by measurable 
random maps, but we can also obtain conditions for the representation 
by continuous random maps. Finally, we show how to construct random 
diffeomorphisms from a given Markov chain. 



1. Introduction 

Amongst the main concerns of Dynamics, one usually wants to decide 
whether asymptotic states of a given class of systems are robust under small 
random fluctuations. Such randomness, corresponding to natural fluctua- 
tions in physical processes, are represented by either a Markov chain model 
with localised transition or by a sequence of random maps. To see how they 
arise, consider a discrete-time system / from a given topological space M 
into itself. Suppose at each iteration of / we allow a small mistake of size, 
say, at most e > 0. Then a Markov chain is defined by a family {p e ( ■ \x)} 
of Borel probability measures, such that every p e (- \x) is supported inside 
the ^-neighbourhood of f(x). The orbit of our dynamics subject to such 
small errors is thus given by sequences of independent random variables 
{xj}, where each Xj+i has distribution p e (- \xj). Alternatively, one could 
think of the orbit as being made by the iteration xj = gj o • • • o gi (xq), where 
each measurable gj is picked at random e-close, in a sense to be made more 
precise, from the original map /. Endowing the collection of maps {gj} with 
a probability distribution v e , we say that the sequence of random maps is a 
representation of that Markov chain if for every Borel subset U 

p £ (U\x) = v e {{g : g{x) G U}). (1) 
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In fact, given any sequence of random maps, one can always find a Markov 
chain which is represented by this scheme |BDV051 D.4], see also |ZaH07| : 
the Markov chain is simply given by ([T]), and one only needs to check that 
this satisfies the Markov chain criterion. The converse problem, however, 
is much more subtle, as we shall see. This is exactly the subject of this 
paper. In other words, we investigate under which conditions imposed on 
the Markov chain one can obtain a representation by a random map scheme 
and how its regularity properties are reflected. 

The study of realisations of Markov chains via random maps goes back 
to Blumenthal and Corson [B1C70] . They considered the case where M is 
a connected and locally connected compact metric space under some strong 
requirements on the probability measures. Let us denote by V{M) the space 
of all probability measures on M. In addition, suppose each x H > p{- \x), 
acting from M to V(M), is continuous relative to the weak* topology on 
V{M). Then, if for each x the support ofp{ ■ \x) is all of M, they showed that 
it is possible to obtain a probability measure v on the space of continuous 
transformations of M such that condition (pQ) is fulfilled. Their proof is based 
upon the existence of a continuous projection of the subset of V{M) whose 
support is all of M onto the space of probability measures on an interval 
whose support is the whole interval itself, and its continuous inverse. The 
assumption of full support on the probability measures is essential to assure 
the continuity of the maps. See for example [Kif86| . 

Weakening this condition, Kifer showed that if M is a Borel subset of 
a complete separable metric space (Polish), then any Markov chain on M 
can be represented by a sequence of measurable random maps [Kif86]. His 
idea was to use Borel measurable isomorphisms of M to Borel subsets of 
the unity interval, as it had previously been shown by Kuratowski. Later, 
Quas |Qua91| tackled the case where M is a smooth compact orientable 
Riemannian manifold. He showed that probability families which are abso- 
lutely continuous with respect to the normalised Riemannian measure whose 
density is smooth can be represented by C°°-random maps. 

Afterwards, Araiijo [AraOO] showed how to construct families of C r -diffeo- 
morphisms on the n-torus near an unperturbed C r -diffeomorphism. He took 
advantage of the parallelizability of this manifold and of its quotient by in- 
tegers. Then he used natural projections to identify orthonormal vector 
fields from which he could build these maps C r -close to the original one; 
see [Ara00[ Example 1] . Using a parametrised geodesic flow, he also showed 
the existence of parametrised families of diffeomorphisms, around an unper- 
turbed one, of any compact boundaryless manifold; see [AraOO I Example 2]. 
Nevertheless, his procedures focus on uniformly continuous perturbations, 
requiring the small noise to uniformly cover a ball of positive radius around 
the unperturbed diffeomorphism. Furthermore, they do not yield a family 
of diffeomorphisms from a given probability distribution. 
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More recently, Benedicks and Viana |BeV06[ Example 1.7], and |BDV05t 
D.4] constructed random maps for small non-uniform noise. They lift im- 
plicitly the measure to the tangent space at f(z), then try to transform the 
distribution to a fixed measure on [0, l] n . If this transformation is invertible, 
its inverse regular enough, and varies smoothly with respect to z, then it is 
possible to select a random continuous map representing the perturbation. 
For topological reasons these constructions may fail on manifolds with non- 
trivial tangent bundle. Namely, their constructions implicitly assume the 
existence of global cross-section of the frame bundle. 

In fact, it is not clear how to choose the random maps representing a 
Markov chain, and there might be many possible such choices. In such a 
situation, a basic strategy of geometric analysis is to select the maps accord- 
ing to some optimization principle. This typically has the advantage that 
an object selected by an optimization principle is not just some solution of 
the problem at hand; it typically enjoys additional properties derived from 
the optimization, and these properties can typically be usefully exploited. 
This is also the strategy we adopt in the present paper. Since the maps 
should relate different measures, it is natural to select them by optimizing 
a transportation problem between those measures. Thus, in this paper we 
introduce techniques from optimal transport in order to tackle the repre- 
sentation of Markov chains under different levels of regularity of the maps. 
The paper is organised as follows. After presenting the main definitions in 
Section [21 we review the main ingredients from optimal transport theory, in 
Section [3j to be used in the remaining part of the paper. In Section HI we 
use optimal coupling to prove Theorem [Aj which shows how Markov chains 
can be represented by measurable maps. Then, in Section [5j we use Moser's 
coupling to show how representation by continuous random maps arise; the 
content of Theorem [B] In the following Section [61 we discuss the regularity 
of densities and how they affect the properties of the transport maps. Then 
in the next Section [7J we use tangent bundle lifts of the measures and cer- 
tain transformations to a fixed measure to construct continuous families of 
probabilities in the bundles. To use similar ideas of Araujo we rely on the 
fact that the tangent bundle of a manifold is always contained in a (smooth) 
trivial bundle, what can be seen by taking an isometric (Nash) embedding 
M ^-?> W 1 . Then we lift (in a nice way) the measures from the tangent bundle 
to this trivial bundle to get a measure family {fJ, x }xeM on M n . Using optimal 
transport theory and its regularity theory we get transformations to a fixed 
measure varying continuously with respect to x € M so that we can select 
sections of this bundle varying continuously, such that the distribution at 
a point x represents the measure p, x . (Smooth) Projections to the tangent 
bundle and the exponential map then give us the random continuous maps. 
Assuming further regularity, these maps are differentiable and we obtain 
random maps C 1 -close to /, and the unperturbed map / is a diffeomor- 
phism, we thus obtain a family of random diffeomorphisms. Therefore, we 
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give a geometric condition for the representation of Markov chains by ran- 
dom diffeomorphisms, summarised in Theorem O In a following up work, 
the methods developed here are being extended in order to study statistical 
and stochastic stability. 

2. Some notation and definitions 

In this section we set up the notation and collect some main definitions 
to be used throughout this paper. For a comprehensive presentation on 
random perturbations of dynamics, see |Kif86l lKif88| . We consider here an 
arbitrary manifold, say M, supposing it to be finite dimensional, equipped 
with some Riemannian structure, fixed once and for all, which induces a 
distance du '■ M x M — > K. We call m its normalised Riemannian volume 
form on M, i.e. m{M) = 1, and unless otherwise stated, we take absolute 
continuity with respect to m. As before, let us denote by V{M) the space 
of all Borel probability measures on M. As usual, V{M) is endowed with 
the weak* topology. 

Regarding measurability and continuity, we recall Lusin's theorem to be 
used in our proofs 

Theorem 2.1. |Fed69l Theorem 2.3.5]. Let M be a locally compact met- 
ric space, N a separable metric space, and fi a Borel measure on M . Let 
f : M — > N be a measurable map, and A C M a measurable set with fi- 
nite measure, then for each 5 > there is a closed set K C A, such that 
/a(A\K) < 5 and the restriction of f to K is continuous. 

2.1. Markov chains and random maps. Let N be another arbitrary 
manifold or a separable complete metric space. We shall consider families of 
probability measures (fi x ) x eM in V(N) given by measurable maps x h-> fj, x . 
We speak of a continuous family of probability measures if the maps x \— > \i x 
vary continuously from M to V(N) relative to the weak* topology. In many 
cases one has N = M or N = M. n . Such families are sometimes called 
continuous Markov kernels. Therefore, Markov chains are special Markov 
kernels obeying some conditional probability with localised distribution. 

Similarly, we can have a more general definition for our random maps. 
For a probability space (Q,A,v), consider a measurable collection of maps 
T : n x M -> N, (u, x) i-> f u (x). Then, we call the family {f w :M^ N) w( z U 
random measurable maps. If in addition each map in (/ w : M — > N)^^ is 
continuous, or a diffeomorphism, then we say that it is a family of random 
continuous maps, or random diffeomorphisms, respectively. They are also 
known as random fields. Then, condition ([T]) says that (/oj^efi represents 
(fJ-x)x&M if the distribution of uj i-> f u (x) equals fi x for all x € M. 

3. On optimal transport 

The remaining part of this paper is based upon techniques from optimal 
transport. Our main reference is the book by Villani |Vil09| . For the sake of 
completeness, we sample and collect in this section the concepts to be used 
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along the way. Readers familiar with optimal transport may wish to skip 
this section and only refer back to it when needed. 

The basic problem in optimal transport, as introduced by Monge, con- 
sisted in moving a given distribution like a pile of sand from one place to 
another with a minimal cost. There are several possible ways to generalise 
and tackle this problem. For example, the given mass to be transported 
can be thought of as a distribution in an appropriate probability space. In 
other words, given measurable spaces M, N, and probability measures \i in 
V{M) and v in V(N), we seek for a coupling, or a way to connect these two 
measures. More generally, one has the following definition. 

Definition 1. Let (M, //) and (N,u) be two probability spaces. We couple \i 
and v by constructing two random variables X, Y on some probability space 
(f2,P), such that law(X) = fj,, lawiY) = v. The law or distribution of 
{X,Y) is called coupling of(fM,u). 

In our context, fx and v are the only laws we shall be interested in, so we 
choose = M x N. There are several examples of couplings arising in 
different contexts. 

The first generalisation of Monge's original problem we can think of is 
given in terms of transport maps. That is, given measurable spaces M, N, 
probability measures fi in V(M) and v in V(N), we seek for measurable maps 
T : M N, such that for all Borel E C N one has ^i{T~ l {E)) = v(E). 
This is an example of a so-called deterministic coupling. The requirement of 
a transport map, however, is a strong condition, and this problem may not 
have a solution unless more restrictions are made. The canonical example 
is when /i is a Dirac measure and v is not. 

In order to avoid ill-posed problems, one alternatively should look for 
weak solutions of the transport problem as it has been proposed by Kan- 
torovich. In this case, we focus on probability measures 7 in V{M x N), 
whose projections (or marginals) are (i and v. In other words, let V{jj,, v) := 
V(M x N) to shorten the notation, and consider the canonical projections 
^V{M) '■ r(/i, v) ~> V(M) and ft-pim '■ r(/i, v) — > V(N), then the marginals 
are given by the push-forward tt-p(m)*7 = A* an d ^v(N)*l = v - The Kan- 
torovich minimisation problem consists in obtaining 



where, for a given cost function c : M x N — > [0, +00], the infimum runs over 
the joint probabilities 7 in T(fi, u). The joint probability measures are called 
transport plans, the ones achieving the minimum, optimal transport plans, 
and C(n, v) the optimal transport cost. Thus, this coupling is called optimal 
transport coupling. Obviously, the solution of the Kantorovich minimisation 
problem depends on the choice of the cost function. The following theorem 
guarantees the existence of optimal coupling. 




(2) 
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Theorem 3.1 (Existence of optimal coupling |Vil09[ Theorem 4.1]). Given 
two Polish spaces (M, fi) and (N,u), and a lower semicontinuous cost func- 
tion bounded from below, then there always exist optimal couplings of(fjL,i>). 

Notice that any transport map T : M — > N induces a transfer plan 
7 defined by (Id x T)*/z. In fact, we can also impose conditions on the 
cost function such that the optimal coupling is obtained by a deterministic 
coupling. The search for a deterministic coupling (transport map) T which 
minimises Eq. [2] for a given cost function is called the Monge problem. The 
following proposition ensures the existence of a unique transport map solving 
the Monge problem. 

Proposition 3.2 (Solution of the Monge problem). Let M be a Riemannian 
manifold, X a closed subset of M , with dim(dX) < n— 1 and y an arbitrary 
Polish space. Let c : X xy ^-M be a continuous cost function, bounded from 
below and assume that for the probability measures fj, € V(X) and v € V(y), 
the optimal cost C((j,,u) is finite. If the following conditions are fulfilled 

i) c is differ entiable everywhere; 

ii) n is absolutely continuous; 

Hi) V x c(x, ■) is injective where defined, i.e., ifx, y, y' are such that S/ X c(x, y) = 
V x c(x,y'), then y = y' , 

then there exists a unique (in law) optimal coupling (x,y) of (fj,,u), and it 
is deterministic. 

Proof. The proof follows from Theorem 10.28, Proposition 10.7, and Remark 
10.33 of [Vil09] . □ 

Corollary 3.3. Suppose that for each ifNioe have a sequence of contin- 
uous cost functions Ck '■ X x y — s> R converging uniformly to c : X x y — s> R ; 
where c is defined as above. Let (ffc)fceN be a sequence of probabilities on 
y converging weakly to v € V(y), and assume that for each k there exist 
measurable maps T/% : X — >■ y, such that each is an optimal transport 
map between \i and v^. Then converges to T in probability, i.e., 

Ve > fi[{x e M;d{T k (x),T(x)) > s}) ► 0. 

k— >oo 

Proof. The proof follows from Proposition 13.21 above and |Vil091 Corollary 
5.23]. □ 

Corollary 3.4. Let M = R n and c(x, y) = —x ■ y. Consider two probability 
measures fx, v on M , such that fi is absolutely continuous, then the solution 
of Monge 's problem can be written as 

y = T(x) = x + V^(x), 

where ip is some convex, lower semicontinuous function. 

Proof. The proof follows from Theorem 10.44, Particular case 10.45, and 
Particular case 5.3 of |Vil09j . See also Section [6] below. □ 
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Remark 1 . Proposition \3.2\ is a slight variation of a more general theorem 
about the solution of the Monge problem. Its conditions can be weakened or 
replaced in a number of ways; see [Vil09l Theorem 10.28]. 

When the cost function is given in terms of distances in a metric space, 
we can use ([2]) in order to define a distance between measures. 

Definition 2 (Wasserstein distances). Let (M,d) be a Polish metric space, 
and p € [0, oo). The Wasserstein distance of order p between any two prob- 
ability measures p,v on M is given by 

W p (p J ,u)=( inf [ d(x,T(x)) p d 7 (x,T(x))) ^ . (3) 
\~/er(n,v) Jm J 

Using the Wasserstein distances we can define a space given by the re- 
striction on V{M) x V(M) on which W p takes finite values, or the space of 
probability measures with finite moment of order p. 

Definition 3 (Wasserstein spaces). The Wasserstein space of order p is 
given by 

P p (M) = jp S V{M) : J d(x ,x) p p(dx) < ooj . (4) 

The choice of xq is arbitrary and the space does not depend on this. 

As a last important example we shall mention the powerful Moser cou- 
pling |Mos65llVil09] . 

Theorem 3.5 (Moser coupling). Consider a smooth compact Riemannian 
manifold M and its volume form m. In addition, consider Lipschitz contin- 
uous positive probability densities po and p\ on M . Then there is a deter- 
ministic coupling of po = p$m and p\ = p\vn. In other words, there exists a 
measurable map T such that for all Borel E C M , we have pi(E) = T*po(E). 
Furthermore, if po,Pi o,re C k,a then T is C k+1,a . 

Remark 2. The map T is explicitly given, for each x E M , in terms of a 
solution of the elliptic equation 

Au(x) = po - pi, 

where A denotes the Laplace operator. The transport map is obtained by 
defining the locally Lipschitz vector field 

cu , Vu(x) 



(1 - t)p (x) +tpi{xY 



which integrates to the flow (T t (x))o< t <i with an associated family of mea- 
sures (pt)o<t<i- In particular, the time-1 map pushes forward po to p\. 
See [Vil09j for more details. 
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4. REPRESENTATION BY MEASURABLE RANDOM MAPS 

From this section we begin to apply the techniques from optimal transport 
just presented. We shall start by tackling the problem of representing a 
Markov chain by measurable continuous maps. Our result implies Theorem 
1.1 by Kifer [Kif86l Ch. 1]. We also treat the case of maps between different 
spaces. The main result of this section is 

Theorem A. Let N be a manifold and consider a measurable family of 
probability measures (n x )xeM with finite p-moments for p > 1, i.e. for some 
yeN 

SUp W p (6 y ,fJL x ) < 00. 

Then there exist random measurable maps (f w : M — > N)^^ representing 

Proof. Let v be any probability measure absolutely continuous with respect 
to some volume measure on N, such that W p (i/,S y ) < oo. Then Propo- 
sition 13.21 shows that for each x € M there is a unique optimal coupling 
realised by a (measurable) transport map T x : N — )■ N, i.e. 

W p (u,v x f = J d(y,T x (y)fd^ Q (y). 

Moreover, by Lusin's theorem and Corollary 13.31 these maps a.e. vary con- 
tinuously with respect to x € M in the topology of convergence in probability 
with respect to v. Setting (n,r) = (N,u), we can take for some random 
variable X : f2 — > N with law(X) = v, the restriction, 

Ux)=T x {X(u)). 

Because of continuity with respect to convergence in probability we can 
assume without loss of generality that (/^) w gn is measurable and separable. 

□ 

5. Representation by continuous random maps 

In this section we apply optimal transport and regularity theory to give 
conditions for the representation of Markov chains by continuous random 
maps. We start with the following result. 

Proposition 5.1. Let (n x )xeM be a continuous family of probability mea- 
sures. Suppose for a fixed measure v £ V(N) with compact support there 
exists a family of continuous maps (T x : suppz^ — > N) x ^m varying continu- 
ously in the C° -topology such that 

(T x )u = (j, x . 

Then (/J. x )xeM can be represented by random continuous maps (fu)ueii such 
that (fi,r) = (suppz^, v). 

If, in addition, T x varies Holder or Lipschitz continuously with respect to 
x 6 M then so does f u with the same constants (resp. exponents). 
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Proof. Set (fi,r) = (suppz^, v) and let X : Q — > f2 be any random variable 
such that lawX = v. Define 

Ux) = T x {X{u)). 

By construction, (fu)uen represents (p x ) x eM- We need to show that f w : 
M — > N is continuous. Since T x varies continuously with respect to the 
C°-topology and its domain is compact we have for a fixed T x 

d{T x (X{u)),T y (X(uj)) < d c o(T x ,T y ) < e 

whenever d(x, y) < 5 for sufficiently small 5 = <5(e, x) > 0. □ 

Remark 3. Proposition \ 5.1\ above generalises Example 1.7 by Benedicks- 
Viana |BeV06j . where v is the Lebesgue measure on [0, l] n and T x is the 
inverse of a rearrangement S x of the positive measure p x S V([— e,e] n ) to 
[0, l] n assuming S x is a homeomorphism which varies continuously in C°. 

Remark 4. To get random continuous maps it is actually enough to assume 
that x i y T x is pointwise continuous v-a.e, i.e. if x — > y then for v-a.e. u) 
we have T x {oj) — > T y (uj). 

Remark 5. Notice that the assumption on the continuity of each T x could 
be relaxed by using Lusin's theorem. Furthermore, using a general version 
of the Kolmogorov-Chentsov continuity lemma for random fields obtained 
in [Pot09] . a similar result holds if we only assume that each T x is Borel 
measurable converging fast enough as x — >■ y in the topology of convergence 
in probability with respect to (£1,t). 

For the following main theorem we apply Moser's coupling. Our theorem 
directly implies the main result of Quas |Qua91| . 

Theorem B. Assume N is a compact Riemannian manifold without bound- 
ary with normalised volume measure m. Let (fi x ) X £M be a continuous family 
of probability measures such that each [i x , absolutely continuous with respect 
to m, has a positive Holder continuous (for some exponent a > 0) probability 
density varying continuously with respect to x G M. 

Then (li x )xeM can be represented by random continuous maps {fj)uen- 

Proof. Let fj, x = p^dm. Then, by Theorem 13.51 there exists a coupling of 
(m, n x ) induced by the time-1 map of a Lipschitz continuous vector field 
varying continuously with respect to x € M. It is given by the solution of 
the following elliptic equation on N 

Au^ = 1 - p^. 

By elliptic regularity theory, vS x ' is C 2,a so we can define the vector field on 
N 
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According to our assumptions it is well-defined and Lipschitz continuous 
and thus integrates to a flow T x ,t '■ N —> N . Furthermore, we have 

(T x ,i)*m = n x . 

Since the densities vary continuously, so does the vector field and thus the 
map T x i which implies by Proposition 15. II that (n x )xeM can be represented 
by random continuous maps (fu)uen- D 

6. Regularity of densities 

The next step is to establish conditions on representations of Markov 
chains by random diffeomorphisms. Before that, we shall again take up 
the discussion on optimal transport and its regularity properties applied to 
the regularity of the densities of Markov chains. We will focus on families 
of probabilities on W 1 . After that, we show how to use the results of this 
section on Riemannian manifolds via lifting and embedding techniques. We 
start by showing how convexity of the support of measures is related to the 
regularity of the transport maps. 

Consider a potential 4> : U C R n ->■ R U {+oo}, and define at y € U' C W 1 

(f = sup(-c(x,y) - 4>{x)). 

Furthermore, let us define 

G^x) = {y£U': <f>(x) + 4> c {y) = -c(x, y)}, 

then we can state the following general result, taken from [Loe09l Theorem 
2.7]. 

Theorem 6.1. Let U,U' C M. n be bounded domains ofM n , and c(x,y) = 
—x ■ y. let fx, and v be probability measures on U , and U' . Assume that 
H does not give mass to sets of Hausdorff dimension less than or equal to 
n — 1. Then there exist a [i-a.e. unique T solving the Monge problem for 
this cost function c. Moreover, there is a convex potential (ft on U , such that 
T = G^. Finally, if tp is convex and satisfies = v, then Vtp = Vcj), 

fi-a.e. 

In fact, for the cost function c(x,y) = —x ■ y, one can show that eft = ip c , 
and that they are Legendre transforms of each other. See Particular case 
5.3, and Definition 5.7 in [Vil09| . 

Regarding the regularityf of (ft, Loeper proves the following result that we 
shall use in the sequel. 

Theorem 6.2 ( |Loe09t Theorem 3.4]). Let U,U' C M n be bounded, strictly 
convex with respect to each other. Suppose fi and v are probability measures 
on U and V C U' , respectively, with V being convex with respect to U . 

H{B e (x)) < C^ 1- ^ for alls > and xEU 
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and v > nm n on V for some k > 0. Assume that for the convex potential 
4> and cost function c(x,y) = —x ■ y we have = v. 

Then 4> is continuously differentiable on U$ = {x £ U \ d(x,dU) > 5} with 
Holder continuous derivatives for 5 > 0. In particular, for some f3 6 (0, 1) 
and C depending only on U, U', m > 0, 5 > 0, p and C^ 

HWc^HUs) < c - 

If, furthermore, fi is supported on some V compactly contained in U then 
G$ is a Holder continuous map with Holder norm bounded by C. 

Consider some compact K C W 1 , and assume (n x )x€K to be a family of 
measures, such that each [i x satisfies the assumption of fj, in Theorem 16.21 
Then it follows that we can define optimal transport maps S x between \x x and 
v = freifoun. These maps are Holder continuous with Holder norm bounded 
by a constant only depending on some compact convex neighbourhood Q x 
of the support of [i x . In particular, by uniqueness of the optimal transport 
map, we have 

Lemma 6.3. Let {^ x ) x <^k be a continuous family of probability measures on 
some compact set K C W 1 . Suppose that each \i x satisfies the assumptions 
of Theorem Ifi.gl with C = independent of \i x and all supports contained 
in some convex U C K n . If we assume that the supports of [i x are contained 
in D = cl(Us) for some 5 > then the optimal transport maps S x between 
fi x and v vary uniformly, that is if then S x : D — > U varies in the 

uniform topology ofC°(D,U ). 

Proof. Because the || ■ H^i^-norm of {4> x } x <=k is uniformly bounded, the set 
is pre-compact in C 1 (D). Since up to a constant the potentials are unique 
we can assume 4> x (yo) = for all x 6 K and some yo G D. The limit 
of lim n _ i , 00 (p Xn — >• (f) XQ for some x n — > xq is also a convex potential solving 
the optimal transport problem with 4> xo (yo) = which implies (p xo = (j) XQ . 
Therefore {4> X } X £K is already closed and thus compact and S x : D — > U' 
varies continuously in the C°-topology as x varies in K. □ 

Proposition 6.4. Let (n x ) X £K C V(M. n ) be as above. In addition, assume 
each (i x is supported on a convex set and has strictly positive Lebesgue density 
on its support, i.e. fj, x > k ■ m for some n > independent of x, then 

£>x\ supp fl x 

: supp/Xz -> [0, l] n 

is continuously invertible. Furthermore, if the supports vary continuously 
with respect to the Hausdorff metric on (compact) subsets of W 1 then the 
inverse T x = 5" 1 : [0, l] n — > supp \i x C U varies continuously in the 
C°([0, l] n , U). In addition, such a family (/J, x )xex can be represented by 
random continuous maps. 

Remark 6. If we do not assume that the supports vary continuously then 
it is still possible to show that the maps T x converge pointwise on [0, l] n to 
T y as x converges to y in K. 
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Proof. By the previous result S x varies continuously in C°(-D,K n ). Then 
Theorem 16.11 implies that the optimal transport problem from v to p x has 
a unique continuous optimal transport map T x : [0, l] n — > supp p x which, 
by Corollary 13.41 is the derivative of a differentiable convex potential ip x . 
Because ip x is (up to a constant) the Legendre transform of the potential 
4> x , whose derivative is S x , we necessarily have T x = S" 1 on [0, 1]". 

To prove that x i— > T x is continuous it is sufficient to show that the 
graphs converge with respect to the Hausdorff metric. Because x i— > S x is 
continuous and the supports of p, x vary continuously with respect to the 
Hausdorff metric, the restricted graph 

9r(S x ) = {{y,S x (y)) e supp^ x [0,1]™} 

is continuous with respect to the Hausdorff metric on subsets of M n x M. n . 
But this implies 

x H- gr{T x ) = (gr(S x ))~ l 
is continuous and thus x \— > T x is continuous as well. 

In particular, Proposition 15.11 implies that any such family (p x ) X £K can 
be represented by random continuous maps. 

□ 

Proposition 6.5. Suppose that p, is absolutely continuous with respect to 
Lebesgue measure m, and v = mirou™. Suppose further that both satisfy 
the assumptions of p and v , respectively, as in Proposition \5.1[ Then S is 
continuously invertible on U , where U is the set of point z which admit a 
(convex) neighbourhood where the density of \x is strictly positive. In partic- 
ular, this holds in {p > 0} = int(supp fi) if the density p of fi is continuous. 

Proof. Let S be the (Holder) continuous optimal transport map from Re- 
mark [3] of Proposition 15.11 Since S and T = S^ 1 (as multi- valued map) are 
the gradients of convex potentials <ft an d Y>, which are the Legendre trans- 
forms of each other, it suffices to show that T is single- valued on S(U). By 
definition we have 

s ( x ) = {y I H x ) + ip(y) = x ■ y} 

and 

ip(y) = sup{xy - 4>{x)}, 
which implies that S~ l (S(x)) is convex. So if we show that T is one-to- 
one on S(V X ) for some small neighbourhood x 6 U then it implies that 
S^ 1 (S(x)) n V x = x, i.e. S and T are both single- valued and thus continuous 
on resp. U and S(U). 

Let 7r be the optimal transport plan between v and p, in particular we 
have 

(id xT%f = 7r. 

Let x be a point in U and V x be a closed neighbourhood with the pre- 
scribed property. By the restriction property for optimal transport plans 
|Vil09l Theorem 4.6] the plan Tri^n xVx is an optimal transport plan between 
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its marginal, i.e. between some v < v and p = p\y x - Obviously this plan is 
also induced by T, i.e. 

(id xT,)i) = tt. 

The measures v and p satisfy the assumptions of Proposition 15. 1| which 
implies that T is (Holder) continuous. Therefore T is one-to-one on supp v = 
S(V X ). □ 

Corollary 6.6. Let p and v be as above. Assume the support of p is strictly 
convex and 

U = {p > 0} = int(supp^) 
where p is the continuous density of p. Then the optimal transport map T 
from v to p is a continuous map from suppz/ to supp p. 

Proof. Restrict S to supp p. By the previous theorem S is injective in the 
interior of its domain. Furthermore, S~ 1 (S(x)) is convex. Because the 
support of p is strictly convex this also implies 5 is injective on the boundary, 
i.e. if x' G S~ 1 (S(x)) for x 6 <9supp/i then \x' + (1 — \)x € dsuppp which 
implies x = x' . 

Because S is one-to-one on its (convex) domain supp/U and S(suppp) = 
suppz/, so is its inverse T on suppzA Thus T is continuous. □ 

The two previous results show that problems happen if the supports are 
not strictly convex or not positive everywhere. Furthermore, if the support 
of p is not simply connected (e.g. has a hole) then this hole is an isolated 
singularity of T in the interior of the support of v. Other phenomena like 
disjoint support might look like complete cut of the support of v. 

Proposition 6.7. Let K be some (compact) set and B\ be the closed unit 
ball in W 1 and (f x : B\ — > f x (B\) C M n ) xe ^ be a family of diffeomorphisms 
(onto their images) varying continuously in C 1 w.r.t. x. Assume (p x )xGK 
is a continuously varying family of measures supported on the images of f x , 
i.e. 

suppp x = f x {Bi). 

Lf each p x is Lebesgue regular with L°° (resp. continuous) density then 
there is a (unique) continuously varying family {v x ) x& k of Lebesgue regular 
measures with L°° (resp. continuous) densities with {f x )*v x = p x . 

Furthermore, if every point in the interior of the support of p x admits a 
neighbourhood such that the Lebesgue density of p x is strictly positive in the 
interior of the support then the same holds for v x . 

Proof. Assume dp x = p x dm and there is some (Lebesgue regular) family 
dv x — $ x dm such that {f x )*v x = p x . By the Jacobian equations we have 

&x(z) = Px(fx(z)) ■ Jfjz). 

So defining ^(z) as above gives us u x . 

Because {f x ) x ^K are continuously varying diffeomorphisms the Jacobians 
Jf x {z) vary continuously with respect to (x, z) G K x B\ which implies the 
statement of the proposition. □ 
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Corollary 6.8. Assume (hx)xgk is a continuous family of measures and 
the supports admit diffeomorphisms as above. Then there exists a family 
of continuous maps T x : [0, l] n — > W 1 varying in the C° '-topology and the 
following holds 

(T x )*m\[o t i]n = fi x . 

In particular, all such families (n x )xeK can be represented by random 
continuous maps (fu)uj£n- 

Proof. Follows from Proposition 16.41 Corollary 16.61 and the previous propo- 
sition. Finally the last statement follows from Proposition 15. 1[ □ 

Remark 7. The corollary can be applied if the supports are star-shaped with 
differentiably varying centre and radial function, i.e. there are z x G B x = 
supp// x and differentiate maps 

r x : §n-i (0)Qo) 

such that x i— y (z x , r x ) is continuous from K to W 1 x C 1 (S n_1 , (0, oo)) (con- 
tinuity of x h-> z x is enough to show that Jf x (z) is continuous in x ). 
The diffeomorphisms f x :B\^ W 1 are constructed via 

fx{(a,p)) = z x + {a,r x (a) ■ p). 

(For simplicity we mixed polar coordinates (ct,p) with Cartesian z x ). 

7. Lifting measures 

In the previous section we constructed random maps for a family of mea- 
sures on M. n . In order to use this result also for a family of measures on 
a manifold we need to lift these measures to measures on W l . The idea is 
to use the local equivalence via the exponential map of a neighbourhood 
of a point p € N and a neighbourhood of of the tangent space at that 
point [Josllj . In other words, the map 

x H- fi x G V(N), 

implicitly lifts to a map 

x ^ Jx x G V(T f{x) N), 

for some continuous map / : M —¥ N, e.g. the centre of mass. In general, 
/ is given by the unperturbed system if one considers a (bounded) random 
perturbation such that 

(expj^-j = jl x . 

In |Ara00l Example 1] Araujo used a similar construction for uniformly 
distributed perturbations and in case the tangent bundle is parallelizable, 
i.e. 

TN = N x R n . 

Notice in that case x i— )■ fi x can be considered as a pair of maps 
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For the second factor there is a random variable X such that the exponential 
map gives naturally the distribution of random maps around / on N, i.e. 

f u (x) = expj^pf^O)). 

More generally, if there is a trivial bundle F = N x ]R fc with a local 
equivalence p : F — > N of a neighbourhood of of Fft x \ to the manifold (or 
the tangent bundle at Tj^N) such that we can lift the measures to 

p x e V(F f ( x )) 

with {p x )x&M satisfying the assumptions of the previous section and 

P*px — P'X 

then there is a family of random maps representing (p x )xeM- 

Since the exponential map at a point p € N is a local diffeomorphism 
between a neighbourhood of that point and a neighbourhood of in the 
tangent space at p, we get the following lemmata. 

Lemma 7.1. Let (p x )xeM be a continuous family of probability on a mani- 
fold N . Suppose that there is a C -map f : M — >• N , for r > 0, such that for 
each x the support of p x is contained in a sufficiently small neighbourhood 
Uf(x) off(x). 

Then (p x )x£M lifts to a continuous family of probability measures (p x ) X £M 
on TN with p x supported on Tf^N (considered as a subset ofTN). 

Lemma 7.2. //, in addition to the assumptions above, p x is absolutely con- 
tinuous with respect to a volume form on N then p x is absolutely continuous 
with respect to the Lebesgue measure on Tj^N (and by equivalence to the 
standard Lebesgue measure on W 1 ). 

Also, if N is compact then the the Lebesgue densities of p x and p x ar ^ 
comparable in the sense that they have the same growth conditions, Lipschitz- 
or Holder-constants or positivity properties of the density in the interior of 
their support. And, in particular, (strict) convexity of the support of p x is 
preserved if the support is contained in a ball around f(x) with radius less 
than the convexity radius of N . 

As a final step we need to construct a trivial bundle with a natural projec- 
tion and show that the measures lift again. As it is well know, the tangent 
bundle TN is a sub-bundle of the trivial bundle F = N x M fc for some k > n 
with a natural projection 

7T : F -> TN, 

which is a linear projection from M k to R n at each fibber at p £ N jJosllj . 

Suppose we have a fixed measure v on M. k which has a smooth density with 
respect to the Lebesgue measure on R fc , such that for any linear projection 
r : M fc — > M. n the measure 

(r) 

v K ' = r*v 

is absolutely continuous with respect to the Lebesgue measure on W 1 with 
positive density inside the interior of its support. Then we claim that we 
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can lift any Lebesgue regular measure jl on M n with support in the interior 
of the support of r*u to a Lebesgue regular measure jl on M. k . 

To show this, notice that jl is absolutely continuous with respect to v^ r \ 
i.e. 

djl(x) = g{x)dv^'\x) 

then defining 

djl{y) = g{r{y))dv 

gives the required measure jl. Obviously if the support of ji is convex so is 
the support of jl and positivity in the interior also lifts in this way. 

8. Representation by random diffeomorphisms 

As a result of the conditions of representation of measures by continuous 
maps, and the lifting properties of measures, we have proved the following 
theorem giving conditions to represent Markov chains by a family of random 
diffeomorphisms . 

Theorem C. Let M and N be compact Riemannian C r -manifolds without 
boundary, and let m be the normalised volume measure on N . Let (fJ, x )xeM 
be a continuous family of probability measures on N such that each fi x is 
absolutely continuous with respect to m, has positive density and convex 
support. Suppose that there is a C r -diffeomorphism f : M — > N , for r > 1, 
such that for each x, the support of fi x is contained in a sufficiently small 
neighbourhood UfM of f{x). Then (fJ> x )xeM can be represented by a family 
(fu)wen of C r -random diffeomorphisms. 

Recall that the space DifF (M, N) of diffeomorphisms is open in C r (M, N), 
for r > 1 [Hir76j . Therefore, what we need to do is to perturb / in the C r 
topology in a controlled way, according to our measure \x x . Indeed, as dis- 
cussed in Section [TJ it is possible to lift the measures on N to its tangent 
bundle via the exponential map and we can identify TN with a subbundle 
of a trivial bundle N xM 1 [Josllj . Then Lemma 17. 1\ and Lemma 17.21 pro- 
vide a natural way of constructing a continuous family of probabilities on 
the bundles according to (fi x ) x eM on N. Furthermore, the construction of 
a trivial bundle shows a natural way of embedding, thus lifting again this 
measures to some M. k , such that the results of Section [6] can be applied. In 
other words, we can continuously select sections of the bundles. Thus, we 
choose maps C r -close to / according to \i x . This completes our argument. 
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